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INTRODUCTORY. 

All  knowledge  -  "beyond  that  of  bare  isolated  oocurrence- 
deals  with  uniformities.   Of  the  latter,  some  few  have  a  claim 
to  be  considered  absolute,  such  as  mathematical  implications 
and  meohanical  laws.   But  the  vast  majority  are  only  partial; 
medicine  does  not  t^ach  that  smallpox  is  inevitably  escaped  by 
vaccination,  but  that  it  is  so  generally;  "biology  has  not  shown 
that  all  animals  require  organic  food,  but  that  nearly  all  do  so; 
in  daily  life,  a  dark  sky  is  no  proof  that  it  will  rain,  but  merely 
a  warning;  even  in  morality,  the  sol%  categorical  imperative 
alleged  by  Kant  was  the  sinfulness  of  telling  a  lie,  and  few 
thinkers  since  have  admitted  so  much  as  this  to  be  valid  uni- 
versally.  In  psychology,  roore  perhaps  than  in  any  other 
science,  it  is  hard  to  find  absolutely  inflexible  coincidences; 
occasionally,  indeed,  there  appear  uniformities  sufficiently  reg- 
ular to  be  practically  treated  as  laws,  "but  infinitely  the  greater 
part  of  the  observations  hitherto  r-eoorded  concern  only  more 
or  less  pronounced  tendencies  of  one  event  or  attribute  to  accom- 
pany another. 

Under  these  circumstances,  one  might  well  have  expected 
that  the  evidential  evaluation  and  precise  mensuration  of  tend- 
encies had  long  been  the  subjeot  of  exhaustive  investigation 
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and  now  formed  one  of  the  earliest  sections  in  a  "beginner's  psy- 
ohologioal  course.   Instead,  we  find  only  a  general  naive  igno- 
rance that  there  is  anything  about  it  requiring  to  "be  learnt.   One 
after  another,  laborious  series  of  experiments  are  executed  and 
published  with  the  purpose  of  demonstrating  some  conreotion  be- 
tween two  events,  wherein  the  otherwise  learned  psychologist 
reveals  that  his  art  of  proving  and  measuring  eorrespondenoe 
has  not  advanced  beyond  that  of  lay  persons.   The  eonee- 
quence  has  been  that  the  significance  of  the  experiments  is  not 
at  all  rightly  understood,  nor  have  any  definite  facts  been 
elicited  that  may  be  either  confirmed  or  refuted. 

The  present  article  is  a  commencement  at  attempting  to 
remedy  this  deficiency  of  scientific  correlation.   With  this  view, 
it  will  be  strictly  confined  to  the  needs  of  practioal  workers,  and 
all  theoretical  mathematical  demonstrations  will  be  ommitted; 
it  may,  however,  be  said  that  the  relations  stated  have  already 
reoeived  a  large  amount  of  empirical  verification.   Great  thanks 
are  due  from  me  to  Professor  Haussdorff  and  to  Dr.  G.  Lipps, 
each  of  whom  have  supplied  a  maeful  theorem  in  polynomial 
probability;  the  former  has  also  very  kindly  given  valuable 
advioe  concerning  the  proof  of  the  important  formulae  for  elimi- 
nation of  "systematic  deviations." 

At  the  same  time,  and  for  the  same  reason,  the  meaning  and 
working  of  the  various  formulae  have  been  explained  suffi- 
ciently, it  is  hoped,  to  render  them  readily  usable  even  by 
those  whose  knowledge  of  mathematics  is  elementary.   The 
fundamental  procedure  is  aocompanied  by  simple  imaginary 
examples,  while  the  more  advanoed  parts  are  illustrated  by 
oases  that  have  actually  occurred  in  my  personal  experience. 
For  more  abundant  and  positive  exemplification, .the  reader  is 
requested  to  refer  to  the  under  oited  research,  whioh  is  entirely 
built  upon  the  principles  and  mathematical  relations  here  laid 
A  own. 

In  conclusion,  the  general  value  of  the  methodios  recom- 
mended is  emphasized  by  a  brief  critoism  of  the  beet  correla- 
tional work  hitherto  made  public,  and  also  the  important  wuest- 
ion  is  disousaed  as  to  the  number  of'Toa8es"required  for  an 
experimental  series. 

Part  1. 

ELEMENTARY  CORRELATION  AND  "ACCIDENTAL  DEVIATION." 

1.  Requirements  of  a  Good  Methof  of  Correlation. 

(a)   Quantitative  expression. 
The  most  fundamental  requisite  is  to  be  able  to  measure  our 
observed  correspondence  by  a  plain  numerioal  symbol.  There 


1.  "General  Intelligence"  determined  and  measured,  to  appear  in 
a  subsequent  number  of  this  Journal. 
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is  no  reason  whatever  to  be  satisfied  either  with  vague  general- 
ities such  as  "large, ""medium, '"small, "  or,  on  the  other 
hand,  with  complicated  tables  and  compilations. 

The  first  person  to  see  the  possibility  of  this  immense  ad- 
vanoe  seems  to  have  been  Galton,  who,  in  1886,  writes:  "the 
length  of  the  arm  is  said  to  be  correlated  with  that  of  the  leg, 
because  a  person  with  a  long  arm  has  usually  a  long  leg  and 
conversely."1  He  then  prooeeds  to  devise  the  required  symbol 
in  suoh  a  way  that  it  conveniently  ranges  from  1,  for  perfect 
correspondence,  to  0  for  entire  independence,  and  on  again  to 
-1  for  perfect  correspondence  inversely.   By  this  means,  cor- 
relations became  comparable  with  other  ones  found  either  in 
different  objects  or  by  different  observers;  they  were  at  last 
oaoable  of  leading  to  further  conclusions,  speculative  and  prac- 
tical; in  a  word,  they  now  assumed  a  scientific  character. 

Mathematically,  it  is  clear  that  innumerable  other  systems 
of  values  are  equally  oonoievable,  similarly  ranging  from  1  Jro  0. 
One  such,  for  instanoe,  has  been  worked  out  and  extensively 
used  by  myself  ( se^fcp  15  ff )  .  It  therefore  becomes  necessary 
to  discuss  their  relative  merits. 

(b)   The  significance  of  the  quantity. 

Galton' s  particular  system  is  defined  and  most  advantageously 
distinguished  from  all  the  others  by  the  important  property, 
that  if  any  number  of  arms,  for  instance,  be  collected  which 
are  all  any  amount,  )(.S"ou  9^ove  the  mean,  then  the  corresponding 
legs  will  average  rx<n  above  the  mean  (with  a  middle  or 
"quartile"  deviation(X)  of  $j  YI^-T^l  Jwhere  0*"co  ^r  the  quartile 
variation  of  the  arms,  5-,  _.  that  of  the  legs,  and  r  is  the  meas- 
ure of  the  correlation. 

But  another-  theoretically  far  more  valuable  -  property  may 
conoeivable  attach  to  one  among  the  possible  systems  of  values 
expressing  t&e  correlation;  this  is,  that  a  measuse  might  be 
afforded  of  the  hidden  underlying  oause  of  the  variations.  Sup- 
pose, for  example,  that  A  and  B  both  detive  their  money  from 
variable  dividends  and  each  gets  1/x  th.  of  his  total  from  some 
souroe  common  to  both  of  them.   Then  evidently  their  respec- 
tive inoomes  will  have  a  oertain  tendency  to  rise  and  fall  simul- 
taneously; this  correspondence  will  in  any  of  the  possible  sys- 
tems of  values  always  be  some  function  1/x,  but  in  only  one  of 
them  will  it  actually  be  itself  6  1/x;  in  suoh  a  favored  case,  if 
A  and  B  get,  say, 20$  of  their  respective  incomes  from  the 
oommon  source,  the  correlation  between  these  tut  imcomes 
will  also  show  itself  as  0.E0;  and  conversely,  if  A's  inoome 
happens  to  be  found  correlated  with  that  of  B  by  0.20,  then 


1. "Proceedings  Royal  Society  of  London",  Vols  XL  and  XLV. 
2.  Commonly,  but  misleadingly,  termed  the  'probable  error." 
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there  is  a  likelihood  that  0.20  of  A'e  income  coincides  with  0.20 
of  B,  leaving  to  either  0.80  disposable  independently.   The  ob- 
served correlation  thus  beoomes  the  diredt  expression  of  the  rel- 
ative amount  of  underlying  influences  tending  for  and  against 
the  eorrespondenoe. 

In  the  above  imagined  instance,  this  desirable  expressiveness 
belongs  to  the  same  above  system  of  values  proposed  by  Galton 
(  and  elabotated  by  Pearson).    But  this  instance  is  excep- 
tional and  fundamentally  different  from  the  normal  type.   Evi- 
dently, A  and  B  need  not  necessarily  derive  exaotly  the  same 
proportion  of  their  incomes  from  the  common  sourwe;  A  might 
get  his  0.20  while  B  got  some  totally  different  share;  in  whioh 
case,  it  will  be  found  that  the  correlation  is  always  the  geomet- 
rical mean  between  the  two  shares.   Let  B  be  Induced  to  put 
all  hj,s  income  into  the  common  fund,  then  A  need  only  put  in 
0.20^^0.04,  to  maintain  the  same  correlation  as  before;  since 
the  geometrioal  mean  between  0.04  and  1  is  equal  to  0.20. 

Now,  in  osychological,  as  in  most  other  actual  correspond- 
ences, A  and  B  are  net  to  be  regarded  as  in  the  fixed  bisection 
of  our  first  case,  but  rather  as  in  the  labile  inter-accommoda- 
tion of  our  second  case.  Hence  A,  in  order  to  be  correlated 
with  B  by  1/x,  must  be  considered  to  have  only  devoted  l/x2# 
(instead  of  1/x)  of  his  arrangement  to  this  purpose  and  there- 
fore still  have  for  further  arrangement  1.-  l/x^phich  w}ll 
enable  an  independent  oorreletion  to  arise  of  /  i-i/xy        In 
short,  not  Galton's  measure  of  correlation,  but  the  square  thereof, 
indicates  the  relative  influence  of  the  factors  in  A  tending 
towards  any  observed  correspondence  as  oompared  with  the 
remaining  components  of  A  tending  in  other  directions. 

(o)  Acourady. 

Prom  this  plurality  of  possible  systems  of  values  for  the 
measure  of  the  correlation  must  be  carefully  distinguished  the 
variety  of  ways  of  calculating  any  one  of  them.   These  latter 
again,  have  various  advantages  and  disadvantages,  of  thiarh 
the  principal  is  their  respective  decrees  0f  liability  to  'acci- 
dental deviation." 

Por,  though  the  correlation  between  two  series  of  data  is  an 
absolute  mathematical  fact,  yet  its  whole  real  value  lies  in  our 
being  able  to  assume  a  likelihood  of  fu**her  cases  taking  a  simi- 
lar direction;  we  want  tp  opnsider  our  results  as  a  truly  repre- 
sentative sample.   Any  one  at  all  aocustomed  to  original  inves- 
tigation must  be  aware  how  frequently  phenomena  will  group 
themselves  in  such  a  manner  as  to  convincingly  suggest  the 
existenoe  of  some  law  -  when  still  more  prolonged  experiment 
reveals  that  the  observed  uniformity  was  due  to  pure  hazard  and 
has  no  tendenoy  whatever  to  further  repeat  itself. 

Luckily,  this  one  great  souroe  of  fallacy  cah  be  adequately 
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eliminated,  owing  to  the  fact  that  suoh  accidental  deviations 
are  different  in  every  individual  case  (hence  are  often  called 
the  "variable  errors")  and  occur  quite  impartially  in  every 
direction  according  to  the  known  laws  of  probability.   The 
consequence  is  thet  they  eventually  more  or  less  completely 
compensate  one  another,  and  thus  finally  present  an  approxi- 
mately  true  result.  Such  elimination,  however,  must  always 
remain  theoretically  imeoraplete,  since  no  amount  of  chance 
coincidence  is  absolutely  impossible;  but  beyond  oertain  limits 
it  becomes  so  extremely  unlikely  that  for  practical  purposes  we 
can  afford  to  neglect  it.   ~'hen  a  person  loses  14  times  run- 
ning at  pitch-and-to88,  he  oan  reckon  that  suoh  a  series  would 
not  occur  by  mere  accident  one  in  9,999  times,  and  conse- 
quently he  will  feel  justified  in  attributing  the  coincidence  to 
some  constant  disturbing  influence.   Similarly,  to  estimate  the 
evidential  value  of  any  other  observed  uniformity,  we  only  re- 
quire to  know  how  nearly  the  odds  against  chance  coincidence 
have  approaohed  to  some  such  standard  maximum  as  9,999  to  1. 
But,  as  any  standard  must  always  be  more  or  less  arbitrary- 
some  thinking  it  too  lenient  and  others  unnecessarily  severe- 
it  is  usual  to  employ  a  formula  giving  not  the  maximum  but 
the  middle  deviation  or  "probable  error".   We  may  then  easily 
find  the  probability  of  mere  hazard  from  the  following  compara- 
tive table: 

If  the  observed  correlation 
divided  by  the  pboballe 

error  be.  ~1234        5       6 

then  the  frequency  of  occur- 
ence by  mere  hazard    -  fr    1/6  1/23  1/143  1/1250  1/19000 

How  ,  the  amallne8s  of  this  probable  error  depends  principally 
upon  the  number  of  cases  observed,  but  also  largely  upon  the 
mathematical  method  of  correlation.   Though  a  faultiness  in 
the  latter  respect  can  theoretically  be  made  good  by  increasing 
the  range  of  the  observations,  yet  such  inorease  is  not  always 
possible,  and,  besides, has  other  grave  disadvantages  which  will 
be  discussed  later  on.   Other  things  being  equal,  therefore, 
the  best  mexhod  is  that  _qne  which  gives  the  least  probable  error 
^or  the  benefit  of  the  "reader ""this  probable" "error  should  always 
be  plainly  stated;  nothing  more  is  required  than  a  rough  ap- 
prozimation;  for  while  it  is  highly  important  to  distinguish 
between  a  deduction  worth,  say,  0.9999  of  perfect  oertainty 
and  one  worth  only  0.75,  it  would  be  a  mere  splitting  of  straws 
to  oare  whether  a  particular  experiment  works  out  to  a  validity 
of  0.84  or  to  one  of  0.85. 

(d)  Ease  of  application. 


1.  In  the  proper  use  of  this  expression. 
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The  most  accurate  ways  of  calculation  are  generally  some- 
what difficult  and  slow  to  apply;  often,  too  there  ooour  cir- 
cumstances under  which  they  cannot  be  U3ed  at  all.   Henoe 
in  addition  to  a  standard  method,  which  must  be  used  for  finally 
establishing  the  principal  results,  there  is  urgent  need,  also  of 
auxiliary  methoc'8  capable  of  being  employed  under  the  most 
varied  conditions  and  wi th  the  utmost  facility. 

But  here  a  word  of  warning  appears  not  out  of  place.   For 
such  auxiliary  methods  are  very  numerous  ana  their  results, 
owing  to  aocidnets,  will  diverge  to  some  extent  from  one 
another;  so  that  the  unwary , "  self  -suggested''  experimenter 
may  often  he  led  unconsciously  -  but  none  the  less  unfairly  -  to 
pick  out  the  one  most  favorable  for  his  particular  point,  and 
thereby  confer  upon  his  work  an  unequi»ocality  to  which  it  is 
by  no  means  entitled.   Any  departure  from  the  rooognized 
standard  methods  are  only  legitimate,  eitner  when  absolutely 
necessary,  or  for  mere  preliminary  work,  or  for  indicating 
comparatively  unimportant  relations. 

2.   Standard  Methods  Explained. 

fa)   Correlations -between  variables  that  can  be  measured 
quantitatively. 

This  may  be  regarded  as  the  normal  type  of  correlation.   Its 
standard  method  of  calculation  is  thatgdiacovered  by  Sravais,1 
in  1846,  and  shown  by  Pearson  in  1896,   to  be  the  best  possible. 
Pearson  terms  this,  method  that  of  'Product  moments.  * 

The  formula  appears  most  conveniently  expressed  as  follows: 


Sxy 


r  - 


\J   3x*.  Sya 


'  such  products  for  all  the  individuals, 

of  all  the  various  values  of  x, 


where  x  and  y  are  the  deviations  of  any  pair  of  characteristics 
from  their  respective  medians, 
xy  is  the  product  of  the  above  two  values  for  any  3ingle  in- 
dividual, 
Sxy  is  the  sum  of 
Sx^-  is  the  sum  of  the  squares 
Sy2-  is  similarly  for  y 
and  r  is  the  required  correlation. 

A  simple  example  may  make  this  method  clearer.   Suppose 
that  it  was  desired  to  oorrelate  acuteness  of  sight  with  that  of 
hearing,  and  that  for  this  puppose  five  persons  were  tested  as  to 
the  greatest  dostenoe  at  which  they  could  read  and  hear  a  stand- 
ard alphabet  and  sound  respectively.   Suppose  the  results  to  be; 


1.  "Memoires  par  divers  savants"  T,  IX,  Paris,  pp255-332 

2.  "Phil.  Trans.,  U.S.,  London"  Vol.CLXXXVII,  A,  p. 164. 
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Person 
A 
B 
C 
D 
E 

then,  we  get 


Sight 

Hearing 

6ft. 

6ft. 

7 

11 

9  (me 

dian) 

12 

11 

10  (median) 

14 

X 

8 

y 

xy 

A 

- 

3 

-   4 

♦  12 

B 

- 

2 

4-  1 

-   2 

C 

0 

+  2 

0 

D 

+ 

2 

0 

0 

E 

+ 

5 

-   2 

-  10 

X2 

y2 

9 

16 

4 

1 

0 

4 

4 

0 

25 

4 

(   +12  — „ 

Sxy  =  (   -12  Sx^  ^  42  Sy  a  25 


4-  12  -  12 


so  that  r  *   i —  0,  and  there,  thus,  is  no  cor- 

'V  42  X  25         respondenee,  direct  or  inverse 
The  "probable  error'  between  any  obtained  correlation  and 
the  really  existing  correspondence  has  been  determined  by 
Pearson,  as  being  'with  suffieient  accuracy 'when  a  fairly 
large  number  of  cases  have  been  taken, 


1  -  r2 


3-  0.674506 


V"  n  (H-r2") 


For  discussion  of  correlation  between  characteristics  whose 
distribution  differs  considerably  from  the  normal  probability 
curve  as  regards  either  ''range",  or  !'shewness,"  reference 
may  be  made  to  the  works  below.    It  may  be  remarked  that 
the  method  of  "product  moments"  is  valid,  whether  or  not 
the  distribution  follow  the  normal  lav/  of  frequency,  so  long  as 
the  "regression"  is  linear. 

(b)   Correlation  between  characteristics  that  can  not  be  meas- 
ured quantitavely. 

In  the  example  quoted  by  Galton,  of  correspondence  betv/een 
the  length  of  arm  and  that  of  leg,  it  may  be  noted  that  the  cor- 
respondence is  proportional  quantitatively;  a  long  arm  has  a 
tendency  to  be  acoompanied  by  a  leg  not  only  long,  but  long 
to  the  same  degree.   Now,  in  many  cases,  such  proportionality 
is  by  the  nature  of  things  excluded;  a  printed  work  is  possibly 
remembered  better  than  one  heard;  but,  nevertheless,  we  cannot 
in  accordance  with  the  preceding  formula,  ascertain  whether 
degrees  of  visuality  are  correlated  to  retentiveness  of  memory, 
seiing  that  in  the  former  case  there  do  not  exist  any  degrees, 
a  word  being  simnly  either  seen  or  not  seen.   Perhaps  even 

1.  Udny  Yule:  "Proc.  R.S.  London,"  Vol.  IX,  p477 

Pearson:  "Phil.  Trans.  2.S.    London."  Vol.CLXXXV,  lA,p71; 

Vol  CLXXXVI,  1  A,  p343,  and  Vol.  CXCI  A,  p. 229. 

G.  Lipps:  'Die  Theorie  der  Collectivgegenstande, "  Wundt's 

Phil.  Stud.,  Vol.  XVII. 
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more  numerous  are  those  cases  where  proportionality  does  in- 
deed exist,  but  practically  will  not  admit  of  being  measured; 
for  instance,  it  is  probable  that  conscientiousness  is  to  some 
extent  a  hereditary  quality,  fret  we  cannot  well  directly  deter- 
mine whether  brothers  tend  to  possess  precisely  the  same 
amount  of  it,  owing  to  the  fact  that  we  cannot  exactly  measure 
it. 

In  all  such  cases  we  mAst  confine  ourselves  to  counting  the 
frequencies  of  ooexistence.  v*e  can  easily  find  out  how  often 
seen  and  spoken  words  are  respectively  remembered  and  for- 
gotten.  It  has  Droved  quite  feasible  to  divide  the  ohildren  of  a 
school  generally  into  "conscientious"  and"non-consoientious, " 
and  then  to  measure  how  much  brothers  tend  to  be  in  the  same 
division;   When  we  have  proved  this  simple  association,  we 
may  provisionally  assume  correlation  of  quantity  also;  that  is 
to  say,  if  the  "conscientious,"  generally  speaking,  have  a  par- 
ticular degreo  of  tendency  to  possess  brothers  likewise  'con 
8cientiou8,n  then  boys  with  exoessively  tender  soruples  will 
have  the  same  degreo  of  tendency  to  possess  brothers  with  sim- 
ilarly excessive  tenderness,  while  those  with  only  a  moderate 
amount  of  virtue  will  be  thus  correlated  with  brothers  also  of 
only  moderate  virtue;  further,  the  ethioal  resemblance  may  be 
expected  to  repeat  itself  in  oouBinB,  etc.  only  reduced  in  prop- 
ortion as  the  kinship  is  diminished. 

For  measurement  of  this  non-proportional  association,  a 
standard  method,  which  may  be  termed  that  of  "cross  mul- 
tiples," has  been  elaborated  by  Sheppard^,  Bramley-Moore, 
^ilon,  Lee,  and  Pearson.   The  formula  is,  unfortunately,  too 
long  and  complicated  to  be  usefully  quoted  in  this  place.  It 
will  be  found  in  the  under  cited  work2  together  with  its  prob- 
able error  as  determined  by  Tearson.3  In  praotioe,  it  will 
generally  have  to  be  replaced  by  one  of  the  more  convenient 
methods  to  be  next  described. 

3.  Comparison  by  Rank. 

This  method  of  "oross  multiples"  is  not  only  difficult  and 
tedious  of  application,  but  also  it  gives  a  probable  error  nearly 
double  that  of  "Produot  moments." 

How,  it  can  often  be  altogether  esoaped  in  the  case  of  quan- 
titiesnnot  admitted  absolute  measurement,  by  substituting  in- 
stead comparison.  This  other  way  will  be  discussed  at  some 
length,  as  it  has  been  largely  used  by  myself  and  is  believed 
chiefly  responsible  for  some  successful  experiments.  All  charac- 
teristics may  be  collated  from  two  quite  distinct  aspects:  either 
(as  in  example  of  visual  and  auditory  acuteness )  by  actual 


l."Phil.  Trans."  Vol.  CXCII,  A,  p. 141. 
2.  "•'hil.  Trans."  Vol.  GXCV  A,  pp. 2-7 
Z.  Phil.  Trans."  Vol.  CXCV  A,  10-14. 
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mensuaation,  or  else  "by  order  of  merit;  we  might  say  that  a 
student,  A,  obtained  0,000  marks  in  an  examination,  while  3 
only  got  6,000;  or  instead,  we  might  say  that  A  r-as  third  out 
of  100  candidates, while  3  was  only  20th.   Teoisely  the  seme 
method  of  calculation  may  he  again  used  in  the  latter  case, 
simply  substituting  the  inverse  ranks,  97,  80,  etc.,  for  the  per- 
formance 8,  8,000,  6,000,  etc. 

f a )   Disadvantages  of  the 'Hank"  m et ho d . 

In  the  first  olace,  it  may  be  objected  that  the  observed  cor- 
relation would  then  only  hold  good  for  persons  of  the  same 
average  differences  from  one  another.   For  assuming,  say,  acute 
sight  to  be  correlated  with  aoute  hearing;  then  the  order  of 
merit  of  A,  B  and  0,  as  regards  sight,  is  more  likely  to  remain 
unaltered  as  regards  hearing  also,  when  the  difference  in  tneir 
respective  powers  of  vision  is  extremely  marked,  thai  when 
they  are  practically  equal  on  the  latter  head.   3ut  the  more 
numerous  the  persons  experimented  on,  the  le3s  will  be  the 
average  difference  of  faculty;  it  might,  therefore,  be  supposed 
that  the  correlation  v?ould  become  continually  leas  perfect  as 
the  experiments  were  made  more  extensive.   This,  however, 
would  be  a  fallacy:  100  experimental  subjects  compared  to- 
gether by  "Hank"  would  on  the  whole  actually  show  appre- 
ciably the  same  average  correlation  an  1,000,  provided,  that  in 
either  case  the  subjects  are  selected  by  chanca;  the  amount  of 
the  correlation  is  not  really  dependent  upon  the  difference  be- 
tween the  grades,  but  upon  the  relation  of  this  difference  to 
the  mean  diviation:  and  noth  of  these  increase  together  with 
the  number  of  subjects;  2n   tho  other  hand,  the  correlation 
will  undoubtedly  diminish  if  the  subjects  be  all  chosen  for-T>  a 
more  homogeneous  class;  in  a  seleot  training  school  for  teacheBS, 
for  example,  general  intellirenoe  will  throughout  show  smaller 
correlation  with  othnr  qualities,  than  would  be  the  case  in  a 
college  for  quite  average  young  men  of  the  eeme  age;  but  this 
fact  applies  just  as  muoh  to  comparison  by  "Measurement. ' 
The  next  possinle  objeotion  is  that  comparison  by  rank 
bases  itself  uoon  an  assumption  that  all  the  oubjects  differ  form 
one  another  by  the  same  amount,  wherwas  A  amy  differ  form 
B  five  times  ar>  "uoh  as  3  differs  from  C.   But  such  an  assump- 
tion would  only  take  place,  if  correspondence  by  rank  were 
considered  to  be  wholly  equivalent  to  that  by  measurement; 
no  suoh  assumption  is  made;  the  two  aspeots  are  recognized  to 
be  theoretically  distinct,  but  advantage  is  taken  of  the  faot  that 
they  five  correlational  values  sencibly  equivalent  in  amount. 
iMB  against  the  small  existing  discrepancy  amy  be  set  off  a 
deviation  of  the  same  order  of  magnitude  which  is  incurred 
when  using  measurement  itself,  owing  to  the  oractical  necess- 
ity of  throwing  the  cases  into  a  number  of  groups. 
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The  thiBd  and  only  solid  objection  ia  that  rank  affords  a 
theoretically  somewhat  less  full  criterion  of  correspondence  than 
does  measurement;  and  the  force,  even  of  this  argument,  disap- 
pears on  considering  that  the  two  methods  give  apprecibaly 
the  same  correlational  values. 

( b )   Advantages  of  the  "Rank"  method. 

The  chief  of  these  is  the  large  reduction  of  the  'accidental 
error."   In  normal  frequency  curve,  the  outlying  exceptional 
oases  are  much  more  spaoed  apart  than  are  those  nearer  to  the 
average;  hence,  any  accident  disturbing  the  position  of  these 
exceptional  cases  will  have  unduly  great  effect  on  the  general 
result  of  the  correlation;  and  owing  to  this  inequality  in  the 
influence  of  the  errors,  the  letter  will  not  compensate  one 
another  with  the  same  readiness  as  usual.  Moreover,  it  is  just 
these  hyper-influential  extreme  eases  where  there  is  most  like- 
lihood of  accidental  errors  and  where  there  very  frequently 
nrevails  a  lew  quite  different  from  that  governing  the  great 
bulk  of  the  cases.  As  regards  the  qaantity  of  this  gain  by 
using  rank  (abstracting  from  the  last  mentioned  point,  which 
cannot  well  be  estimated  in  any  general  manner)  there  should 
be  no  difficulty  in  calculating  it  mathematically.   Prom  u.   con- 
siderable amount  of  empirioal  evidenoe,  the  probable  error  when 
using  the  method  of  'product  mements"  with  rank  appears 
to  become  lesB  than  two-thirds  of  that  given  by  the  same  method 
with  measurement,  ana  therefore  only  about  one-third  of  that 
given  by  the  method  of  "cross  multiples." 

The  ne-xt  advantage  it  that  rank  eliminates  any  disparity 
betve^n  the  two  characteristic-  compared,  as  regards  their 
general  system  of  distribution;  such  a  disparity  is  often  not  in- 
trinsic or  in  any  way  relevant,  but  merely  an  effect  of  the  par- 
ticular manner  of  gaining  the  measurement.   By  means  of 
rank,  e  series  presenting  the  normal  frequency  curve  can  be 
compared  on  even  terms  with  another  series  whose  curve  in 
entirely  different.   This  cannot  well  be  cone  when  us±±g 
measurements.   (See  p. 7). 

Btak  has  also  the  useful  property  of  allowing  any  two  aeries 
to  be  saelly  and  fairly  combined  into  a  third  composite  one. 

( o )   Cone  ixis  ion. 

Pron  the  practical  noint  of  view,  it  is  so  urgently  desirable 
to  obtain  the  smallest  nrobable  error  with  a  given  number  of 
subjects,  that  the  ruethoo.  of  rank  must  often  have  the  prefer- 
ence even  when  we  are  dealing  with  t.vo  series  of  measure- 
ments prooerly  nornuorable  with  on*  -nother. 

Theoretically,  rank  is  at  any  rate  ^referable  to  such  a  hybrid 
and  unmeaning  correlation  as  that  between  essential  measure- 
ments on  the  one   eioe  and  mere  arbitrary  clusai Cio u tion  on  the 
other.   ,'s  the  latter  occur  in  most  psychological  correlations, 
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the  only  other  resource  would  he  to  avoid  measurements  alto- 
gether hy  using  the  method  of  'cross  multiple.''   But  this 
trehles  the  size  of  the  probable  error,  end  therefore  renders  it 
necessary  that  the  subjects  should  be  no  less  than  nine,  times  as 
numerous;  such  an  enormous  Increase,  even  if  possible,  would 
generally  be  accompanied  by  disadvantages  infinitely  outweigh- 
ing the  supposed  theoretical  superiority  of  method. 

The  above  advantages?  are  still  further  enhanced  whenever 
dealing  with  one-eioe^  frequency  carves,  such  as  are  furnished 
by  most  mental  tests.   ?or  in  these  cases  the  great  bulk  of 
influence  u^on  the  resulting  correlation  is  derived  exclusively 
from  the  very  worst  performances  and  is  consequently  of  a 
specially  doubtful  validity. 

In  short,  correlation  by  rank,  in  most  cases  a  desirable  pro- 
cedure, is  for  Ehort  soriss  quite  indispenable,  rendering  them 
of   equal  evidential  value  to  much  longer  ones  treated  by  other 
ways.   Luckily,  it  is  preolsely  in  short  series  that  gradation 
by  rank  is  pracic^lly  attainable. 

(4)   Auxiliary  Methods, 
^hene,  as  has  been  said,  are  only  for  use  vhen  there  is  ade-  - 
quate  roaeon  for  not  employing  the  above  "standard"  methods. 
Any  number  are  devisable.   Their  resulting  correlational  values 
do  not  quite  coincide  with  those  found  by  the  standard  says, 
but  nearly  «nough  so  for  roos-it  oractical  purposes. 

(a)   Auxiliary  methods  of  Pearson 

;:,overal  very  ingenious  t*.nd  convenient  ones  are  furnished  by 
him,-1-  but  all  of  similar  type  and  requiring  the  s^me  date  as 
that  of  "cross-multiole. "2   They  hro  therefore  for  uhr  when 
the  oomparel  events  do  not  admit  of  direct  quantitative  corre- 
lation.  The  following  appears  to  combine  facility  and  precision 
to  the  greatest  degree: 

r  -  sin   2 

where  the  two  compared  series  of  characteristics,  say  P  and  Q 

each  divided  into  two  (preferably  about  equal)  classes;  if 
thn  case  is  one  where  quantity  exists  but  cannot  be  absolutely 
measured,  P  II  will  comprise  the  instances  1n  "vhich  ^  is  in 
manifest  aefloiency;  hut  if  the  ompared  characteristics  essen- 
tia} ly  exclude  quantity,  ?   it  becomes  the  instannas  where  n   is 


1.  "Phia. Trans.  R.f-.T,.,"  Vol.  CXC7,  A,  pp  1  and  79. 

ad  7b° 

2.  They  nra   all  refinements  of  the  original  formula,  r  -  ad  -f-  be 

oublished  by  Yule,  Proc.  3.3.L.,"  Vol.LXVI,  0.23. 
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absent;  similarly  Q.  Then, 

a  -*  the  naznber  of  times  that  P  I  is  accompanied  by  ti  I 
b  *     *        n  "   P  II       "       "  Q  I 

C  '»       "  ■         "PI  "         r'   Q  II 

d  "     ■        "       "   P  II       ■       "  Q  II. 

If  a -+- b  is  not  very  unequal  to  o|  d,  the  probable  error 
may  be  taken  at  about  1.1/Vn,  where  n  —  the  number  of  in- 
stances in  the  whole  of  P  or  of  Q.1 

Returning  to  our  previous  illustration,  suppose  that  it  was 
desired  positively  to  ascertain  the  merits  cf  instruction  by  writ- 
ing and  by  word  of  mouth  respectively.   Ten  series,  each  con- 
sisting of  ten  printed  words,  have  been  successively  shown  to  a 
olass  of  twenty  children,  who  each  time  had  to  write  down  by 
memory  as  many  as  they  could.   The  experiment  was  next 
repeated,  but  reading  the  woud3  aloud  instead  of  showing  them. 
Of  the  2,096  visual  impressions  900  were  corectly  remembered, 
while  of  the  same  number  of  auditory  ones  only  700  were  re- 
tained. 

Gall  the  visual  impressions  P  I 

-  '  auditory    "       P  II 

"    ''  remembered  Q  I 

"    "  forgotten    "       Q  II 

then  a  -  900,  b  -  700,  c  -  1,100,  I  -  1,300,  and 
T-alnJE-     — ^---^-Y- 1 0.16 

2  a^975o"  yL.soof  y~7"o"o"    yl.ioo 

the  probable  error  then  comes  to  1.1/  4  000  "  nearly  0.02, 
or  about  1/8  of  the  above  correlation;  so  that  The  latter  would 
not  occur  by  mere  chance  once  in  100,00  times. 

We  thus  see  that  there  is  at  any  rate  good  prima  facie  evi- 
dence of  some  superiority  on  the  part  of  the  visual  sense.  Also, 
if  the  experiment  has  been  fairly  executed  and  adequately  de- 
scribed, any  subsequent  verification  under  sufficiently  similar 
conditions,  by  other  experimenters,  should  resllt  in  a  concordant 
correlation,  probably  between  0.04  and  0.28. 

Moreover,  we  have  obtained  a  direct  estimate  of  the  impor- 
tance of  this  apparent  superiority  of  the  visual  sense;  for  the 
square  of  the  correlation  amounts  to  0.025;  so  that  of  the  vari- 
ous oauses  here  tending  to  make  the  children  remember  some 
words  better  than  others,  the  difference  of  sense  impressed 
comes  to  about  one  fortieth  part  (see  p.  4). 


1.  More  accurately,  sin  0.1686^  (1  -  r2)./l   11   1 

Y    a^To^c^d" 


■ 


13. 


f b )   Method  of  proportional  charges. 

This  is  very  often  convenient,  being  especially  applicable  to 
a  lerge  number  of  psyohologioal  experiments,  and  so  easy  that 
the  result  can  be  approximately  seen  on  inspection.   Here, 

r  ■   3  .a  -  b  X   *«1 
2~  a  -  b 

where  a  -  the  number  of  oases  that  have  changed  in  accord- 
ance with  the  supposed  correspondence,  and  b   the  number 
th.*it  have  changed  in  contradiction  of  it.   The  probable  error 
again  comes  to  1.1 

Suppose,  for  example,  wo  were  demonstrating  that  intellect- 
ual fatigue  may  be  satisfactorily  investigated  by  the  method  of 
Qriessbach.^     th  this  view,  wo  have  applied  his  test  to  100 
boys  before  and  a£ter  their  lessons.   In  the  latter  case  68  of 
thoro  have  presented  the  expected  duller  sensitivity,  but  32,  on 
the  contrary,  have  shown  a  finer  discrimination  than  before 
work. 

How,  clearly,  na<i  the  enrresgonclenoe  been  perfeot,  all  the 
hundred 


clearly,  nart  the  correspondence  been  perfeot, 
would  hve  become  \70--se.*  Thus, 


3  69  -  32   -  0.54. 

r  -   2"  "  100 

As  tne  probable  error  corses  to  0.11,  our  imaginary  correla- 
tion is  five  tinea  greater,  and  therefore  would  not  have  occurred 
by  mere  aocident  more  than  once  in  1,250  times;  so  that  we 
beoeme  nreotioslly  certain  tha  +  the  sensitivity  of  the  skin  realty 
floep  measure  fatigue. 

It  now  becomes  oafy  to  ensnare  the  quantity  of  this  fatigue 
at  different  stages  of  work.   Let  us  say  that  further  experi- 
ments, after  lessors  lasting  one  hour  longer  than  before,  showed 
the  correlation  had  risen  to  0.77.   Thereby  we,  see  that  the  in- 
fluence of  fatigue  swells  from  0.542  to  0.77,^  that  is,  from  being 
1/5  to  being  3/5  of  all  ihe  sources  of  variation  in  cutaneous 
pensitivity.   8uch  e  result  has  e  very  different  scientific  sig- 
nificance from,  say,  any  conclusion  that  the  average  sensory 
threshold  had  enlarged  by  so  many  more  millimetres. 


1.  Hence,  when  the  correlation  is  very  complete,  say  over  0.75 
the  above  formula  gives  appreciably  too  large  values;  as  the 
amount  reaches  0.90  and  1,  the  first  faotor  must  be  reduced  from 
3/2  to  5/4  and  1  respectively. 

2.  This.as  is  well  known,  consists  in  determining  the  least 
distance  apart  at  which  two  points  of  oontact  oan  be  distinguis- 
hed as  being  double  and  not  single. 

O.       A  a  cpnwH  r>»  +V>  n  +      4  a      +n      a  r>  it  4-Vi  ■  +■      oil       4-V.  n      Vnito      YiannvnA     £atlCHAri      V>tT 
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Moreover,  our  t-at  can  be  easily  &nd  precisely  compered  with 
any  of  the  various  other  recommended  urqoeaurea,  being  more 
reliable  than  all  which  present"  smaller  correlations  and  vice 
versa . 

f o)   Method  of  class  averages. 

It  often  happens  that  measurements  (  or  ranks)  are  known 
but  not  in  such  a  way  as  to  be  eble  to  use  either  the  method  of 
"product  moments''  or  even  any  of  the  methods  of  Pearson. 
Under  such  circumstances,  I  have  found  it  very  useful  to  be 
able  to  apply  the  following  relation: 

r   -   d         id 
"  5 
where  d  is  the  observed  difference  between  the  average  meas- 
urement (  or  rank)  of  the  P's  aooompanied  by  Q  I  and  that  of 
those  accompanied  by  Q  II,  and  D  ia  the  greatest  difference 
that  T^as  possible  (such  as  would  have  occurred,  had  the  cor- 
respondence been  perfect).  If  Q  has  been  divided  into  two 
about  equal  portions,  D  will  be  eoual  to  twice  the  middle 
or  "quartile"  deviation  from  the  average  in  the  whole  series  P; 
while  if  Q  has  been  divided  after  tfte  usual  fashion  into  three 
such  portions,  only  the  two  outer  onea  can  be  uaed  and  then 
D  £  2.87  times  the  above  middle  deviation  (again  taken  in 
the  whole  series  P) 

Suppose,  for  example,  that  we  winh  to  ascertain  whether  the 
well  known  teBt  of  "reaction-time"  givea  any  indication  as  to 
the  oerson's  general  soeed  of  movement.   We  try  a  hundred 
persons  both  in  reaction-time  and  in  speed  of  running  50  yards. 
Then  we  divide  the  reaction-time  records  into  two  classes,  I 
containing  all  the  quickest  performers  end  II  al  the  slowest. 
We  now  see  how  long  these  two  classes  of  reacters  took  respec- 
tively to  run  the  fifty  yards,  and  what  was  the  middle  deviation 
from  the  average  among  all  the  runners  taken  together.   Let 
us  out  the  average  of  class  I  at  6  seconds,  that  of  class  II  at 
6.5  seconds,  and  the  general  middle  deviation  at  1.1  seoonds. 
Then 

6.5  -  6 
r-     Ex  1-1      Z.  °-2J5 

The  evidential  value  of  the  result  is  given  approximately, 
even  for  small  values  of  n,  by  the  following  relation: 


"fVoh&ble  error  -  1 .  17    ^^rT  -f-  1 


where  n  is  the  total  number  of  oases  considered.  In  the  three- 
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fold  instead  of  twofold  division,  the  probable  error  beoomes 
nearly 

1.4      fn_-  1 
,  n     "V^T-  2 

In  the  above  instance,  we  find  "that  the  observed  correlation 
is  little  over  double  the  probable  error;  as  so  much  would  turn 
up  about  once  in  six  times  by  mere  accident,  the  evidence  is 
not  at  all  conclusive.   Therefore  we  must  either  observe  many 
more  cases  -  600  would  be  necessary  to  reduce  the  probable 
error  to  l/5th  of  the  correlation  -  or  else  we  must  fon<*  a  better 
method  of  calaulation.   If  rank  had  been  employed  instead  of 
measurement,  the  evidence  would  already  have  been  fairly 
good,  and  could  have  been  put  beyond  all  reproach  by  the 
addition  of  another  150  observations.   If  rank  had  been  em- 
ployed in  conjunction  with  the  methof  of  "product  moments" 
or  that  of  "rank  differences,"  the  required  smallness  of  proba- 
ble error  oould  have  been  obtained  by  as  few  as  36  cares  in  allf 

The  method  of  "class  averages"  is  especially  valuable  in 
deciphering  the  results  of  other  investigators,  where  the  average 
performances  and  the  middle  deviations  are  usually  given  (in 
good  work),  but  not  the  data  required  for  any  of  the  other  methods. 

(d)   Method  of  rank  differences. 

This  methoff  ap*pears  to  deserve  mention  also,  seeing  that  it 
seems  to  unite  the  facility  of  the  auxiliary  methods  with  a 
maximum  accuracy  like  that  given  by  "pBOduct  moments". 
It  depends  upon  noting  how  much  each  individual's  rank  in 
the  one  faculty  differs  from  his  rank  in  the  other  one;  evidently 
this  will  be  nil  when  the  correlation  is  perfect,  and  will  increase 
as  the  correlation  diminishes. 

T~.        This  general  idea  seems  to  have  been  first  due  to  Binet  and 
Henri  ("La  fatigue  intelleotuelle''  p. 252-261),  who,  however 
do  not  work  it  out  far  enough  to  obtain  any  definite  measure 
of  correlation.   Accordingly,  Binet  makes  little  further 
attempt  in  later  research  (L'annee  psychologique, Vol. IV) 
to  render  it  of  service,  and  soon  appears  to  have  altogether 
dropped  it  (L'annee  psychologique, Vol.  VI.). 

The  same  isea  occurred  to  myself  and  was  developed  as 
above,  without  being  at  the  time  acquainted  with  the  prev- 
ious work  in  this  direction  by  Binet  and  Henri.  In  obtaining 
the  above  formulae  I  was  greatly  assisted  by  Dr.  G.  Lipps ' 
showing  generally  that  when  an  urn  contains  n  balls  numbered 
l,2,3,..n,  respectively;  and  when  they  are  all  drqwn 
in  turn  (without  being  replaced);  and  when  the  difference 
is  each  time  noted  between  the  number  on  the  ball  ahd  the 
order  of  its  drawing;  then  the  most  probable  (  or  middle) 
total  sum  of  8uoh  differences,  added  together  without 
regard  to  sign,  will  be 

n2  -  1 
3 
Previously  I  had  only  calculated  tfcis  value  for  each  parti- 
cular size  of  n  required  by  myself,  Prof .Hausdorff  further 


showed,  generally,  that  such  sum  of  differences  will  present 
a  mean  square  deviation  (from  the  above  most  probable  value) 

~TrTT"l)'(2n2  +  7) 
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The  relation  is  as  follows: 

R  r  1  -   3Sd 


tf^^l 
where  Sd  is  the  sum  of  the  differences  of  rank  for  all  the  indi- 
viduals , 

n  is  the  total  number  of  individuals, 
and  B  is  the  required  correlation. 

The  probable  error  will  then  be  approximately,  even  for 
small  values  of  n,  ^  0.4/^n. 

To  take  again  the  example  from  p. 9,  we  number  the  five 
persons  aocording  to  their  order  of  merit  in  hearing  and  seeing 
respectively. 


person 

seeing 

hearing 

difference 

rank 

rank 

A 

1 

1 

0 

1 

2 

4 

2 

C 

3 

5 

2 

D 

4 

8 

1 

E 

5 

2 

3_ 

3d  -  8 


so  'that 


R  -  1  -  3 8 

25-1 


and  again  we  find  that  there  is  no  correlation,  direst  or  inverse. 

This  method,  though  very  accurate  and  pre-eminently  quick 
in  application,  has  unfortunately  four  serious  disadvantages. 

It  oan  be  only  used  for  ranks,  and  not  immediately  for  meas- 
urements. 

The  probable  error  given  is  only  that  showing  how  great 
correlations  may  be  expected  from  pure  accident  when  there  is 
no  really  existing  correspondence  between  the  two  characteris- 
tics.  It  does  not  (like  ^earson's  probable  error  for  the  method 
of  "product  moments")  direotly  show  how  much  the  observed 
correlation  may  be  expected  to  differ  by  accident  from  any  cor- 
respondence, that  does  exist. 

The  various  possible  values  of  Sd  are  found  to  fall  into  a 
frequency  curwe  of  marked  asymmetry;  so  that  we  cannot  (as 
in  all  the  other  methods  here  given)  take  the  minus  values 
of  R  as  representing  so  nmch  inverse  correlation.   This  defeot 
could  be  remedied  nathematioally;  but  there  are  also  other 
respects  in  which  this  side  of  the  frequenoy  olrwe  appears  un- 
suitable for  our  purpose,  sq  that  it  is  better  to  treat  every  oor- 

T~.      This  formula  becones  slightly  incorrect,  whenever  two  or  more 
individuals  are  bracketed  as  having  precisely  the  same  rank;  but  the 
consequent  error  is  usually  to  be  worth  considering. 
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relation  as  positive  (which  can  always  bo  don©  by,  If  necessary, 
Inverting  the  order  of  one  of  the  series). 

Finally,  this  value  R  Is  n6t  numerically  equivalent  to  the 
"r"  found  by  all  the  other  methods,  but  for  chance  distribu- 
tions appears  ZyJ     r*.  So  far,  the  proof  of  this  relation  1?  only 
empirical,  btit  1l  resfis  on  a  large  number  of  cases  taken,  how- 
ever, only  between  o.20  and  0.60.   If  It  be  accepted  r  can  at 
once  be  found  from  the  following  table; 
R  .0.05  0.10  0.20  o.30  0.40  0.50  0.60  0.70  0.80  0.90  1 
r  0,13  Q,P-2   0.34  0.44  0.54  0.63  0.71  0.79  0.86  0.93  1 

Part  II. 
Correction  of  "Systematic  Deviations. " 

1. Systematic  Deviations  Generally. 

In  the  first  part,  we  have  seen  that  any  correlational  experi- 
ments however  extensive,  can  only  be  regarded  as  a" sample" 
out  of  the  Immense  reality,  and  will  consequently  present  a 
certain  amount  of  accidental  deviation  from  the  real  general 
tendency;  we  have  further  seen  that  this  accidental  deviation 
Is  measurable  by  the  "probable  error"  whose  determination, 
therefore  becomes  an  indlspensab;e  requisite  to  all  serious 
research. 

But  now  we  are  tn  danger  of  falling  from  Scylla  into  Charyb- 
dls.   7or  after  laboriously  compiling  sufficient  cases  and 
conscientiously  determining  the  probale  error,  there  exists  a 
very  human  tendency  to  cease  from  labor  and  inwardly  rejoice 
at  having  thus  risen  from  common  fallacious  argument  tothe 
serene  certainty  of  mathematics.  But  whether  or  not  such 
complacency  may  be  justifiable  in  pure  statistical  inquiry,  it  is 
at  any  rate  altogether  premature  in  the  kind  of  research  that 
e  are  at  present  contemplating;  we  are  not  dealing  with  sta- 
tistics, but  with  a  line  of  work  so  fundamentally  different  that 
it  ma-"-  be  aptly  distinguished  by  the  term  of  "statlstlcoids. " 
Here  the  accidental  deviation  Is  not  the  sole  one,  nor  even  the 
most  momentous;  there  are  many  other  enemies  who  are  un- 
moved by  the  most  formidable  array  of  figures.  These  consist 
in  such  deviations  as,  Instead  of  merely  being  balanced  lmper- 
perfectly,  lie  wholly  pm  tfle  one  side  or  the  other.   As  in  ordin- 
ary measurements,  so  too  In  correlation,  we  may  speak,  not  only 
of  "accidental"  "  variable"  or  "compensating"  inaccuracies, 
but  also  of  "systematic, 2  "constant,"  or  "non-compensating"  ones. 

These  systematic  deviations  are  of  very  varied  nature,  the 
Tost  insidious  being  as  tsual  self  suggestion.  To  tnke,  for 
instance,  one  of  our  recent  examples,  suppose  that  we  have 
applied  the  (Irlessbach  test  to  a  mxmber  of  children  before  and 
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after  their  lessons,  and  have  found  the  desired  correlation  be- 
tween fatigue  and  cutanuous  insensltlvlty  It  still  remains  ex- 
ceedingly difficult  to  convince  ourselves  that  we  executed  our 
tests  entirely  without  favor  or  affection;  for  it  Is  almost  Im- 
pos.ible  to  determine  a  series  of  pensory  thresholds  without 
some  general  tendency,  either  to  bring  them  towards  the 
desired  shape,  or  else  -  endeavoring  to  escape  such  bias-  to 
force  them  in  the  opposite  direction.   To  convince  others  of 
our  impartiality  may  be  harder  still.  Even  this  sort  of  devia- 
tion is  to  be  remedied  by  our  proposed  exact  method  of  pro$> 
cedure  for  by  It  we  obtain  perfectly  definite  results  which  ^pp 
any  impartial  experimenters  may  positively  corroborate  or  refute. 
2.  'Attenuation"  by  Errors. 
Prom  page  3  it  will  be  obvious  that  a  correlation  does  not 
simply  depend  on  the  amount  of  concording  factors  in  the  two 
compared  series,  but  solely  on  the  proportion  between  these        i 
§oncording  elements  on  the  one  hand  and  the  discording  ones       I 
on  the  other.   In  our  examplem  it  did  not  matter  whether  A 
and  B  each  had  one  pnnnd  or  a  thousand  pounds  in  the  com- 
mon funds  but  only  whether  the  amount  was  a  small  or  large 
fraction  of  their  "hole  Incomes.  If  the  discordance  ,1-x,  be 
nil,  then  the  concordance  x  is  thereby  perfect,  that  Is,  -  lj 
and  if  the  Influence  of  the  discordant  elements  be  sufficiently 
increased,  then  any  concordance  will  eventually  become  infi- 
nitely small. 

To  consider  a  still  more  concrete  example  suppose  three 
balls  to  be  rolled  along  a  well-kept  lawn;  then  the  various  dis- 
tances they  go  will  be  almost  perfectly  correlated  to  the  various 
forces  with  which  they  were  labelled.  Put  let  thefte  balls  be 
cast  with  the  same  inequalities  of  force  down  a  rough  moun- 
tain ride;  then  the  respective  distances  eventually  attained 
will  heve  but  faint  correspondence  to  the  respective  original 
momenta . 

Thus  it  will  be  clear  that  here  the  acoidental  deviations  have 
a  new  consequence  simultaneous  with,  but  quite  distinct  from,-, 
that  discussed  in  the  last  chapter.  For  there,  they  Impartially 
au^-nented  and  diminished  the  correlation,  tending  in  a  pro- 
longed series  to  always  more  and  more  perfectly  counterbalance 
one  another;  and  in  ordinary  measurements,  this  is  their  sole 
result.   But  here  in  correlations,  they  also  have  this  new  effect 
which  is  always  in  the  direction  of  "attenuating"  the  apparent 
correspondence  and  whose  amount,  depending  solely  on  the 
size  of  the  middle  error,  cannot  be  in  the  least  elininated  by 


I.  ^hle  fact  har  already  been  mathematically  expressed  in  the 

last  chapter  bv  the  value  of  correlation  between  two  series  being 
proportional  (Inversely)  to  the  value  of  the  middle  deviation 
Inside  the  series  (see  p.  15). 


■w4  no  J 


Jon 


- 


over! 

#0 


19 


any  prolongation  of  the  rorles.  The  deviation  &a  thus  become 
general  or  "ayste natlo. " 

Now  suposse  that  we  wish  to  ascertain  the  correspondence 
between  a  series  of  values,  p,  and  another  series,  q.   By  prac- 
tical observation  wo  evidently  do  not  obtain  the  true  objective 
values,  p  and  q,  but  only  approximations  which  we  will  call  p' 
and  q'.  Obviously,  p'  la  loss  oloaely  connected  with  q'  than  is 
p  wj.th  q.  for  the  first  pair  only  correspond  at  all  b-  *the  inter- 

\atlon  of  tho  Becond  p«  lr;  the  r  al  con  ej  pondence  between 


p  and  q  short!;;  rpt^,h?s  been  "attenuated  "  into  r    ,    •. 

To  ascertain  the  amount  of  thl*  attenuation  tad  thereby  dis- 
cover the  true  correlation  It  appear"  necersa-ry  to  make  two  or 
Tiore  independent  r<  rier.  of  observations  of  U^th  o   and  ~.  Then 


rpq 


rp'q' 


r 


r 


Y  Vf«  '  rqV 

where* p'q'  —  the  mean  of  the  correlations  between  ?a6h  Rerlea 

of  values  ofct' ir.ed  for  p  with  each  serlea  obtained 
for  q. 

d'd' 

p  y  —  the  average  correlation  between  and  another 

of  those  several  Independently  obtained  aeries  of 

values  for  p. 

r    ,    ,        the   same  as   regards  q. 

q  q    z 

and     rpq       _     the   required  real   correlation  between  the    true 
objective  values   of  p  and  q. 

Tims,    if  for  each  charsct*  r  <    tic   two   such  independent      reries 
of  observations   be    -r>.ade ,  iPc^l  anci  Qg   then   tie   true 

rpq  z    _*piqi  4  rpiqa  4.  rp^qi  ±.  rp2qa 


4  J         (rplpJ2  X     r<ilq2    > 

Snould  circumstances  happen  to  render  say  pi  BUeh  lore 
accurate  tnan  p£  then  the  eorrolatlona  involving  pi  will  be 
considerably  . raater   than   those   Involving  pJJ.      In    such  ca;;    , 

tne  numerator   of   the  above      fraotlo       nx.-t  be    formed  by  the 
eometri   al   Instead   or  b"  the   arithmetical  mean;    her  by  the 
accidental  error-   of    the  reapaotlve   observations   cease   to  elim- 
inate  one  another  and  thereforo   double    their  final   'nflur.nce;    U.ey 
also    introduce   an  undue  di-iinufcion  of   the   fraction. 

In  so~ie   exceptional  and  principally  very  theoretical  oaaen,    it 
may  happen  t.et  either  of  the  actual  leaasurenentft $   pay  p'l  lr. 


1     3y  an  inversion     of  the  above    for^ule,    the   correlation  be- 
tween  two   series   of   oheravatlons  will   be   found  a  useful 
measure    of   the   accuracy   of   the   observation. 
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*  d.      Then,  the  correlation  rpiq.  wln  be  t 
extent  Increased  without  any  proportional  increase  in  rl*J 


lue, 
ries 


hence  our  above  formula  will  fallaciously  present  too  large  a  va 

-«*♦./« er*?ter  Prac4ca}-  difficulty  is  that  of  obtaining  two  ser 
sufficiently  independent  of  one  another.  For  many  errors 
are  likely  to  repeat  themselves;  even  two  separate  observers 
ere  generally,  to  some  extent  warped  by  the  same  Influences; 
we  are  all  imposed  on  by,  not  only  the  "Idola  P.pecus,"  but 

alsothe  "Idola  Tribus"  and  the  "Idola  Fori."  In  ruch  ease* 

the  above  formula  is  still  valid,  only,  its  correction  does  not  go 

quite  far  enough,  -  a  fallacy  at  any  rate  on  the  right  side. 

An  actual  instance  will  best  show  the  urgent  necessity  of 

correcting  this  attenuation.   Tn  a  correlation  between  two 

events,  say  P  and  Q,  I  obtained  three  independent  observations 

both  of  P  and  of  Q.   The  average  correlation  for  those  of  P 

w.lth  those  for  Q,  was  o.38  (j;  rpiqi)j  the  average  correlation 

of  those  for  P  with  one  another  was  0.58  (^  rDi  i\.  the  same 

for  Q  was  0.22  (j;  r  i  i^  Therefore,  the  correspondence 

,  P  and  Q,  comes  by  recbonlng  to 

approximately  1;  s«  thai  the  corre;  pondenc 


between  the  real  events,  P  and  Q,  comes  b^  recbonlng  to 
0.38 


r 


6.58  X   0.22 

fn^teii'd    of  being   merely    0.38 1    appeared    to   be   absolute   and   oo^n- 
plete.  vk&thir 

Attenuation  by  errors   can  alsobe   corrected  in  another   man- 
ner,  which  has   the   great   advantage   of  an   Independent   empirical 
basis,   and   therefore   of  not   being   subject  to  either   of  the   tw;o 
above   mentioned   fallacies  besetting  the   other   method.      Hence, 
when  the   results   coincide   both  ways,    the   fallacies   In  question 
may  thereby  be   considered  as   disproved,    for   It   is  very  unlikely 
that   they  should  noth   be   present  and  in  such  proportions   as   to 
exactly  gaflfteM.  »ne   another.    In  this   method,    Instead  of  directly 
employing   the   values   pip?p,  ^    etc>  ^   ai]alganate    thern  into 

a   single    list;   by  this    -ffdns  Ve'clerl        eliminate   some    portion 
of  the    Individual   observational  errors,   and   thereby  we   cause 
any  really  existing   correspondence    to  repeal    Itself   in  greater 
completeness.      Now,   this    Increase    In  correlation   from  this   par- 
tial elimination   of  errors  will  furnish  a  measure   of  the    Increase 
to  be  expected  from  an  entire   elimination   of  errors.      Assuming 
the   mean  error  to  be   Inversely   proportional  both  to  this   in- 
crease   In  the   correlation  and    to  the   qquare   root   of  the  number 
of   lists   amalgamated,    the   relation  will  be: 
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connected  with  q'  ( or  q)  quite  independently  of  p  or  any  other 
link  connon  to  p$  Then,  the  correlation  rp'q'  will  be  to  that 
extent  increased  without  any  proportional  "increase  in  rp'p1; 

hence  our  above  formula  will  fallaciously  present  too  large  a 
value . 

▲  greater  practical  difficulty  is  that  of  obtaining  two  eerjee 
sufficiently  independent  of  one  another*  For  many  errors 
are  likely  to  repeat  themselves;   even  two  separate  observers 
are  generally,  to  some  extent,  warped  by  the  same  influences; 
we  are  all  imposed  on  by,  not  only  the  "IdOla  Specus,"  but 
also  the  "Idola  Tribus*  andthe  "Idola  Fori."  In  aoh  oases, 
the  abobe  formula  is  still  valid,  only  its  correction  does  not  g> 
quite  far  enough,*  a  fallacy  at  any  rate  on  the  right  side. 

An  actual  instance  will  best  show  the  urgent  necessity  of 
correcting  this  attenuation.   In  a  correlation  between  two 
events,  say  F  and  4,  I  obtaiad  three  independent  observations 
both  of  F  and  of  4.   The  average  correlation  for  those  of  P 
with  those  for  4  was  0.38  [-   rp'a')f  the  average  correlation 
of  those  fcr  F  with  one  another  Was  0«58  (-  rp'p');  the  same 

for  «i  was  0.22  {•   r.'  ').  therefore,  the  correspondence 
between  the  real"ev5nts,  P  and  ^  comes  by  reckoning  to 

— j  1  im0*-^     ■+   approximately  1;  so  the  the  correspondence 

instead  of  being  merely  0.38,  appeared  to  be  absolute  and 

cjjjmplete.  -     ._ 

Attenuation  by  errors  can  also  be  arrected  in  another  man- 
ner, whiah  has  the  great  advantage  of  an  independent  empirical 
basis,  and  therefore  of  not  being  subject  to  either  of  the  two 
above  mentioned  fallacies  besetting  the  other  method.   Hence, 
when  the  results  coincide  both  ways,  the  fallacies  in  question 
may  thereby  be  considered  as  disproved,  for  it  is  very  unlike  Jy 
that  they  should  both  be  present  and  in  such  proportions  as  to 
exactly  cancel  ona  another.   In  this  method,  instead  of  Urectly 
employing  the  values  p^  P2  P3  ,    etc.  we  amalgamate  them  Into 
a  single  list;  by^is  meaJlB   we  dearly  eliminate  some  portion 
of  the  individual  observational  errors,  and  thereby  we  cause 
any  really  existing  correspondence  to  reveal  itself  in  greater 
completeness.  Fow,  this  increase  in  correlation  from  this  parjr 
tial  elimination  or  errors  will  furnish  a  measure  of  the  increase 
to  be  expected  from  an  entire  elimination  of  errors.  Assuming 
the  nean  error  to  be  inversely  proportional  both  to  this  in- 
crease in  the  correlation  and  to  the  square  root  of  the  number 
of  lists  amalgamated,  the  relation  will  be:      '. 
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where  m  and  n  -   the  number  of  independent  gradinge  for  p 

and  q  respectively, 

rp'q'  Z  the   mean  correlation  between  the  various 
gradings  for  p  and  those  for  q, 

and  rp„  „  -   the  correlation  of  the  amalgamated  series 
for  p  with  the  amalgamated  series  for  q. 

In  the  above  quoted  instance,  the  three  observations  for 
series  P  were  amalgamated  into  a  single  list,  and  similarly 
those  for  series  ^.  Upon  this  being  done,  the  two  amalgama- 
ted lists  now  presented  a  correlation  with  one  another  of  no 
less  than  0.66(-  r   ^ )  .   Thus  by  this  mode  of  reckoning, 

the  real  correspondence  became 

—      y   M* V(   °'^  *  °*3Q    z   once  more  approximately  1, 

so  that  thi«  way  also  the  correspondence  advanced  from  O.38  to 
absolute  completeness. 

If  more  than  two  independent  series  of  observations  are 
available,  we  may  acquire  additional  evflence  bjr  trying  the 
effect  of  partial  amalgamtion.  Instead  of  throwing  all  our 
obtained  values  together,  we  may  form  a  set  of  smaller  combi- 
nations for  each  of  the  two  compared  characteristics,  and  then 
see  the  mean  correlation  between  one  set  and  the  other.  In 
our  obove  instance  instead  of  summarily  considering  p'i  p'2  P't. 
we  can  have  pi  pi  pi  p-J  and  pi  pi   and  find  out  their 
mean  correlation  with  •similar^ value 8  for  q.   This  works  out 
actually  to  0.55*  Hence 

4/2X2X0.55  -  0-38  - 
rBa     z  ,  .  ,     approximately  1. 

*        -4TTX2~-  1 

Thus  agAin  ,  by  this  third  may,  where  both  t  rme  are  the  maan 
of  9  observed  correlational  values,  the  correspondence  once 
more  rises  from  the  apparent  0 .38  to  the  real  1.  (1) 

3 •  Limits  of  Associative  Problems. 
We  have  seen  that  "the  length  of  the  arm  is  said  to  be  cor- 
related with  that  of  the  leg,  because  a  person  with  a  long  arm 
has  usually  a  long  leg  and  conversely;"  also  that  this  corre- 
lation is  defined  mathematic  .lly  by  any  constant  which  deter- 

(1)  The  exactnes  of4  the  coincidence  between  the  two  methods  of  cor- 
rection  is  in  the  above  instance  neither  greater  nor  less  than  genera- 
lly coours  in  practicw.  It  wa;  singled  out,  in  order  to  show  that 
the  f or-mlae  still  hold  perfectly  good  even  for  such  an  enormous 
rise  as  from  O.38  to  1.  The  possibility  of  such  a  rise  is  due  to  the 
unusual  conditions  of  the  experiment  in  question,  whereby  the  three 
observations  of  the  same  objective  series  presented  the  extra- 
ordinarily small  intercorrelation  of  0.22. 
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Mines  the  function  of  any  definite  size  of  arm  to  the  mean  cf  the  sizes  of  the 
corresponding  legs.  These  terns,  tckan  literally,  are  very  ,*ide  reaching  and 
express  what  we  will  call  the  "universal"  correlation  between  the  two  organs. 

But  evidently  not  the  most  painstaking  investigation  can 
possibly  -  cure  any  adecuately  regprosentatiV9  ranple  for  such  universal  correlations, 
even  in  the  simple  case  of  anas  and  legs.  To  begin  with,  they  would  have  to  be 
equally  dertrafi  from  s/ery  stage  of  growth,  including  th?  prenatal  period;  since 
thla  ta  bha  Host  influeicial  of  all  censes  of  variation  in  3ize.  In  the  next 

cc,  hay  vould  h-tve  to  cone  from  every  historical  epoch,  containing  their  fair 
proportion  o  '  big  Crc-Liagnons,  little  Fuifoozers,  ete.  Fu  ther,  they  must  impart- 
ially include  every  living  race, from  the  great  Patagonians  to  the  diiinutive 
M,Kabbas;  also  every  social  class,  from  the  tall  aristocrats  to  the  nndcrnized 
slumr '--rs. 

Practically, then,  the  universal  correlation,  even  if  desirable, 
la  ouite  inaccessible,  i'/e  are  forced  to  successively  introduce  a  large  number 
of  restrictions:  the  sample  is  confined  to  adults,  to  moderns,  to  some  particular 
country,  otc,  etc.  In  a  word,  ue  are  obliged  to  deal  with  a  special  correlation. 

When  we  proceed  to  more  narrowly  considers*  these  restrictions,  it  soon  becomet. 
clear  that  knap  are  Tar  iTcmi  being  really  detrimental.  Fo  every  serious  investi- 
gation will  be  found  to  So  directed,  however  vagu-ly  and  unconsciously,  by  some 
hypothesis  as  to  the  causes  both  of  the  correspondence  and  of  the  digression  there- 
from, (see  page  74).  This  hypothesis  will  determine  a  particular  system  of 
restriction,  such  as  to  set  lha  correspondence  in  the  most  significant  relief. 

But  frojn  thuj;e  restrictions  will  at  the  MM  fclwi  proceed 
several  kind*  of  grave  errors.  la  the  fist  place,  aince  the  restrictions  are  not 
explicitly  recognized,  they  often  are  not  carried  out  In  a  manner  scientific-ally 
profitable;  txay  then,  the  result,  however  true,  nay  nevertheless  be  trivial  and 
unsivge stive.  Tot  instance,  a  series  of  experiments  was  recently  executed  by  one 
of  our  best  known  psychologists  and  ended  -  to  his  apparent  satisfaction  -  in  showing 
that  so:*  children's  ;:chool-o:.c.er  was  largely,  correlated  with  their  height,  weight 
and  strength.  As,  however,  no  step  had  be-3n  taken  to  exclude  the  variations  due 
to  difference  of  ege,  the  only  reasonable  conclusion  seemed  to  be  that  as  children 
grow  oiler  they  turned  out  in  fact  to  probably  be  the  true  and  sufficient  one. 

The  next  fault  to  be  feared  is  equivocality.  For  evan  if  the  controlling  under- 
thought  '  <>.   ~ood,  yet  its  indtstioctness  1b  f  e  mind  of  the  experimenter  caused  the 
restriction  to  be  carried  cut  so  unsystematically,  that  the  results  inevitable 
kaono  become  ambiguous  and  fruitless. 

The  last  is  that,  even  with  the  clearest  purpose,  this 
•penalization  of  the  correlation  is  an  exceedingly  difficult  matter  to  execute 
successfully/  Only  by  a  profound  knowledge  of  the  many  factors  involved,  can  we 

at  all  adecuately  exclude  those  irrelevant  to  ou*  main  intention. 

St 

Now,  all  such  elements  in  a  correlation  as  are  foreign,  to  the 
iv  tor's  explicit  or  implicit  purport  will,   like  the  attuating  errors, 

constitute  imputities  in  it  and  will  quantitatively  falsify  its  appraent  amount. 
This  will  chiefly  happen  in  two  ways. 
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4.   "Gonstrijtion^M^Dilfltion." 

Any  correlation  of  either  of  the  considered  characteristics  will  have 
been  admitted   irrelevantly,   if  it  has  supervened  irrespectively  of  the  original 
definition  of  the  correspondence  to  be  investigated.     The  variations  are  thereby 
Illegitimately  constrained  to  follow  pome  irrelevant  direction  so  that  (as  in  the 
c^ae  of  Attenuation)   they  no  longer  possess  full  amplityde  of  possible  correlation 
in  the  Ixtva  ed  direction;   the  jiaximum  instead  of  beinc:  1  will  be  only  a  fraction, 

and  all  tne  leaser    tegreea  of  correspondence  will  be  similarly  affected}   such  a 
falsification  laay  be  called  "constriction.  M    Much  more  rarely,   the  converse  or 
••dilation"  aril?,  oeetcp,  by  correlations  being  irrelevantly  excluded.     The  disturbance  li 
aureoble  by  the  following  relation: 

rpej  ■ £-Pfr 


V   1     ~  r2  pr, 


where  r  r—    te     the  epparent  correlation  of  p  and  q.  the  two  variables  to  be 


^ 


compared, 


rpv       «  the  correlation  of  one  of  the  afcove  variables  with  a  third  and 
irrelevantly  ;  .dratted  variable  v. 

rp|      =■  the  real  correlation  between  p  and  q  aft  or  compensating  for  the 
ligiti'ate  influence  of  v. 

Should  any  further  irrelevant  correlation,   say  r™-    be  admitted,  then 
1 


*n  -         '.* 


V  1     ~"  r"  pv.   — r' 
In  the  reverse  case  of  "dilation," 

these  formulae  vrf.ll  bo  easily  senn  to  oe  at  once     derivable  from  the  relations 

stated  on  page*  74  tnd  75.     Shall,   irrelevant  variations  evidently  do  not  affect 

the  result  in  any  sensible  degree,  vdiile  large  ones  are  capable  of  revolutionizing  it. 

The  following  is  an  actual  illustration  of  this  constriction,     I  was 
investigating  the  correspondence  between  on  the  one  hand  intelligence  of  school  lesfom 
and  on  tho  other  the  faculty  of  discrimination  of  nusical  pitch.     The  correlation 
proved  to  be  >: .  "  .     But,  upon  inquiry,   it  turned  out  that  more  than  half  of  the 
chil'ren  took  lessons  in  nusio  and  there  fore  enjoyed  artificial,  training  as  regards 
pit oh;  here,   then,  was  a  powerful  cause  of  variation  additional  as I  quite  irrelevant  t< 
the  research,  which  dealt  with  the  correspondence  between  the  two  natural  faculties. 
When  this  diiifcurvact  had  once  boen  ditected,  there  was  no  difficulty  in  eliminating  it; 
influence  by  the  above  formula;  the  correspondence  between    )itch  kki£  discrimination  am 
iTusic  1  -  wus  i.ieasured  at  0.61;   so  that  the  true  required  correlation  became 

o.49 

+  0.62. 

V"l     -  0.61  -  • 
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In  this     particular  case,   the  more  desirable  course  was  open  of  elimination  t] 
trietion,   practically,  by  confining  the  experiment  to  those  cMldren  who  were 
learning  Nttftfl  and  t  orefcre  vnve  on  «  sufficient  equality  as  re.-ards  the  training 
ion  then  sained  ia  this  purely  empirical  way  exactly  coincided  with  the 
former  results,  being  again  0.6?,. 

Ifcereas  Attention  and  Constriction  have  wholly  tended     to  reduce  the 
apparent  correlation,   and  Dilation  to  enlarge  it,  we  now  corn©  to  a  third  kind  of 
laparitjr  that  .aay  c-qu  lly  well  reduce  or  enlarge.     Its  effects  is  thus  ar<alogous  to 
the  first  consequence  of  accidental  errors  discussed  in  the  first  part  of  this 
article,  but  unlike  the  latter,  this  Distortion  does  not  in  the  lea3t  tend  to 
alimiiiate  itself  in  the  longest  series  of  oh  serrations. 

Distortion  occurs  whenever  the  two  sories  n£  to  he  compered  together  both 
correspond  to  any  appreciable;  degree  tfltfe  the  same  thirl  irrevant  variant.     In  tbia 
case,   the  relation  is  given  by 

rl       —  r     .  rqy       Ml  (1) 

r  =  P<3  2* 

Vl-lV)     (1  ~  rV 

where  r  w«  the  apparent  correlation  between  p  and  q,  the  two  characte  istics  to 

be  comnared,  v 

* 

r^      and  r        ■  the  '•orreLations  of  pond  q  with  some  third  and  perturbing 

variable  v. 


and  r   =  the  required  real  correlation  between  p  and  q  after  compensating  for  the 
M  illigitiviate  influence  of  v. 

Should  the  common  correspondence  with  v  iiave  been  irrelevantly  excluded 
instead  cf  admit  bed,  the  relation  becomes 


r   =  r 

jpq 
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•  J  (  1  -  r"  pr  )   (1  -  r*  qr)  ♦  *pv  *<& 


In     h     course  of  the  sa*as  investigation  above  alluded  to,  but  in  another 
eohool,   the  correlation  between  school  intelligence  and  di .  iation  of  pitch  turned 

out  to  be  -  G.25,   so  mtly  not  the  cleverer  bjrt  the  stupider  children 

could  diocri  d  ate  hestl         ut  no>   it  was  obae  ved  that  a  superiority  in  discrimination 
had  been  shown  by  the  olde     children,  amount  i.vs  to  a  corre3aticn  of  0.55;  while,   for 
a  then  uuk'.iown  reason,  the  schoolmaster's  estimate  of  intcOJifrce  had  shoen  a  very 

■  :od  (though  unconscious)   partiality  for  the  younger  onez,,  amounting  <>o  a  con-elation 
of  0.65.     Banes  the  time     correlation  reckoned  cut  to 

-  0.25     -  0.55     X   j-  0.65) 


Y(l-  0.55s      )        (1  -  -     0.65       ^   ) 

=     +  D«17.     Ihis  latter  low  but  direct  correlation  was  -  under  the  particular 

1.  •  38  oame  formular  has  already  been  arrived  at,  ttmagh  though  along  a  very 
different  route,  by  Yule.     See  Iroo.  R.S.L.  Vol.  IX. 
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circumstances  of  the  experiment  -  unquestionably  about  corr&ct;   go  that  tne  one 
originally  observed  of  =  -0.;j5  would  have  been     entirely  misleading* 

6.     Criticism,  o  '  Srevelent  forking  Vethods. 

So  far,   our  illustration  of  systematic  deviation  has  been  cor.f  rned  to  instancee 
taken  from  personal  experience.     3ut  it  might  perhaps  be  thought  that  other  workers 
avcxd  such  perversions  of  fact  by  the  simpler  method  of  common  sense.     Unfo  utnately, 
such  does  not  seam  to  have  been  at  all  the  case;  not  once,   to  the  best  of  my 
knowledge,  has  any  partial  association  between  two  psychological  events  been 
determined  in  such  a  way  as  to  present  any  good  evidential  value  -  th^se  are 
strong  terms,  but  I  think,  hardly  exaggerated. 

Psychologists,  i/ith  scarcely  an  exception,  never  seem  to    have  become 
acquainted  with  the  brilliant  work  bslag  carried  on  since    1R86  by  the  fjalton- 
Pearson  school.     The  consequence  has  been  that  they  do  not  even  attain  to  the 
first  fundamental  requisite  of  ccrre3.ation,  nanely,   s  preeise  quantitative 
expression.     Many  havs,   indeed,   taken  great  pains  in  the  matter  and  have  construct- 
ed arrays  of  complicated  numerical  tables}  but  when  we  succeed  in  orienting  our- 
selves in  the  somewhat  bewildering  assemblage  of  figures,  we  generally  find  that 
they  have  omitted  precisely  th     few  facts  which  are  essential,   so  that  we  cannot 
even  work  out  the  correlation  for  ourselves. 

This  lack  of  quantitaive  expression  entails  fa*  more  than  merely  diminished 
exactitude.     For  in  consequence,  the  expreimenters  have  been  unable  to  estimate 
their  own  results  at  all  carreotly,   sane  have  believed  themselves  to  demonstrate 
an  entise  absence  of  sorrespondeaee,  when  the  latter  has  really  been  quite  con- 
siderable} -Hhereas  others  have  presented  to  the  public  as  a  high  correlation  what 
ha3  really  been  very  small  sad  often  well  within  the  limits  of  mere  accidental 
coincidence}  these  limits  they  have  had  no  ma  nn  of  determining,   anS    moreover 
their  data  were  usually  obtained  in  suah  a  way  a3  to  make  it  unnecessarily  large. 

Seeing,  thus,   that  even  the  elementary  requirements  of  food  correlation 
wfcrk  described  in  the  first  pert  of     his  article  have  been  so  generally  deficient 
we  cannot  be  surprised  to  find  that  the  moreadiiunced  refinements  of  procedure 
discussed  in  the  second  part  havs  been  almost  wholly  tmr#gerded|   so  that  the 
final  results  are  saturated  and  falsified  with  every  description  of  impurity. 
In  this  respect,  unfortunately,   it  i3  no  longer  possible  to  hold  up  even  the 
^talton-Pearnon  school  as  a  model  to  >>e  1  1.     The  letter  must    iow.  perform  the 

very  dMBfetftt  office  of  saving  us  from  detailed  criticism  of  inferior  work,  by 
•mob ling  us  to  form  an  opinion  as  be  how  ttrtfar  the  defect  permeates  and  vitiates 
even  the  ":.est  exintont  correlational  research. 

Aa  sxsj&gle,  ws  will  take  :fo .-:■■" son's  chief  line  of  invstigetion,   Collateral 
He  edity,  at  that  point  where  it  ccr.es  into  sleseet  co  tact  wltfe  our  own  topic, 
Psychology.     3ince  1089  he       9,  :tion  anil  assistance,  been 

•  a  vast  number  of  data  as  to  the  amount  of  correspondence  existing  between 
brother**     A  perliminary  sale  ilatlen,   based  in  'v>oh  eone  upon  600  to  1,000  pairs, 
led,   Is  1910  to  the  publication  of  the  following  momentous  results: 
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Daaling  with  the  moans  f  steal  find  mental  characters,  Wl  ero  forced 

iioral  nature  la,  quite  as  r.iicL  aj  out  physical  nature,  the  PHttum  of  heredity 
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How  let  ue  ooaeldep  hew  these  scoff  iejentit  of 
correlation  will  be  effected  by  our  "systematic  deviations. "    To  begin  with 
too  ''Mtenuation"  by  errorej  sines  it  evi&er.tly  cannot  be  cseunod 
that  the  schoolmasters*  judgwsnts  ae  to  conscientiousness,  temper,  etc.  ere 
absolutely  '•*  frllible.     On  :?n&  90,  it  fees  teen  ahown  that  deviation  frcsi 
this  noureo  cay  he  estlnetet.  by  the  i'ollesrtn«  do  rtiist 
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1  am  sware  of  no  precise  data 


beyond 


that  found  ir  oorae  «ixp  ssm,  t.f.ere  it  intollectual 

grading*  for  the  sai*  eorlon  of  etibjoete  con-elated  with  one  another  on  an 
average  to  the  amount  of  -•&£•     A.a  on  other  ocaaciona  very  competent  poisons 
here  estimated  this  to  be  as  zaioh  as  should  he  expected,  as  as  intellsaence  is 
about  i.he  most    easily  gradablo  of  all  ohe  dental  qualities  mentioned  by  Boarson,  t 
there  is  so  for  no  r  io  supposo  thnt  '  ■    ' ■      .  "ber  of  rasters  and 

id  on  ,o,  ever  Rssl  that 

the  mistaiws  in  eatijnntfcig  one  brother  were  independent  of  the  air-stakes  in 
other,  thoa  the  true  correlation  so  .Id  be  oJ  otot,  not  0.5172, 
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0*01,  an   extort  of  -.1  iff  erenow  that  seriously 


V  6.04  X  0.C4 
codifies  our  liinnssflws  of  exactitude  from  all  these  coefficients  to  four 
places  of  decimals,   'hen  wb  further  consider  thet  eaoh  of  these  physical  and 
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mental  heredity  can  hardly  be  more  than  mere  accidental  coincidence. 

Let  us  next  proceed  to  irrelevant  correlation,   and  take  for  our 
theme  postnatal  accidents  connected  on  the  one  side  with  brotherhood  and  on  the 
other  with  the  mental  qualities.     Pearsonfs     primary  intention  seems  to  have  been 
to    lake  his  correlations  as  "universal"  as  possible  and  in  one  place  he  expressly 
mentions  that  education  is  among  the  causes  contributory  to  variation.     Kence,  he  is 
more  than  consistent,   in  that  he  forms  his  correlation  without     regard  to  the  fact 
that  the  HMEEsiatiax  correspondence  between  the  brothers'   "conscientiousness," 
"popularity,'1  etc.,  must  be  in  gnsrt  measure  due  to  their  coming  under  the  same 
home  influences.     But  such  a  correlation  can  scarcely  be  accepted  as  scientifically 
valuable,     Fo     we  do  not  really  know  anything  precise  about  the  assimilating  effects 
of  heredity,  when  our  observed  correspondence  is  perhaps  chiefly  clue  to  the  brothers 
having  tin  amount  of  campers  and  pocket-money.       Still  less  can  we,  then, 

fairly  compare  such  results  with  that  Obtained  from  physical  ~r>.easureinentsr  where 
consnom  home  life  has  little  or  no  effect.     The  factor  of  post-ratal  accidents, 
therefore,   cannot  but  be  regarded  as  irrelevant,   and  consequently  the  coefficients 
of  correlation  must  be  taken  as  hope  ^s sly  "distorted." 

But  even    -onsistfarce  cannot  be  ughald  throughout  the  matter.     -For 
though  xbe  effect  of  postnatal  life  has  thus  been  admitted  with     regard  to  education 
at  home,  it  has  perforce  been  excluded  as  regards  public  education.     For  only  those 
brothers  have  been  compared  together  who  are  at  the  same  school;   the  coefficients 
of  correlation  would  certainly  diminish  if  those  also  could  be  included  who  are 
living  in  a  totally  different  manner,  hatoe  gone  to  see,  etc.,     The  correlations  are 
therefore  also  illegitimately  "dilated." 

If  this  work  of  Pearson  has  thus  been  tingled  out  for  criticism, 
it  is  certainly  from  no  desire  to  undervalue  it.     The  above  anfi  any  other  systematic 
errors  are  eventually  capable  or  adequate  elimination,   and  this  orticla  has  itself, 
it  is  hoped  been  of  some  use  towards  that  purpose.     Such  correction  will  no  doubt 
necessitate  an  i' mouse  amount  of  further  investigation  and  labour,  but  in  the  end 
his  results  will  acquire  all  their  proper  validi  y.     My  present  object  is  only  to 
gu  rd  against  premature  conclusions  and  to  point  out  the  urgent  need  os  still 
further  i:  proving  the  existing  nethodiC3  of  correlation  work,   a  mfclihod  cf  investig- 
ation which  he  mims-  If  has  so  largely  helped  to  cx-eate  and  by  means  of  which  he  is 
carrying  lipht   into  immense  regions  hitherto  buired  inthe  obscurity  cf  irresponsible 
speculation.     The  fundamental  difference  between  Lis  prodedure  and  that  here  rs- 
ecBmended,  is  that  he  seeks  large  natural  samples  of  any  existinc  aa*le>l  sufficient- 
ly homogeneous  to  be  treated  mathematically}  whereas  here  seller  samples  ere 
deemed  sufficient,     ut  they  are  required  to  be  artificially  selected,   oreVtreA,  and 
corrected  into  full  scientifio  significance,     ^lie  methods  are  those  of  pure  statistics 
those  inculcated  bore  nay  bo  nore  aptly  termed  "atatistiooids." 

7.     Ullfliai  of  Case3  Desirable  for  an  Experiment. 

This  leads  us  to  the  important  question,   as  to  how  many  cases  it 
it  advisable  to  collect  for  a  si  ries  of  experiments.      In  actual  oractiee, 

the  at  diversity  has  been  apparent  in  this  respect;  many  have  thought  to 

'identity  .establish  important  correlations  with  loss  than  ten  experimental 
subjects  while  others  ha'oe  thought  it  necessary  to  gather  together  at  least  over 
a  thousand. 


«l 


»:. 


28. 

Now,  a  scries  of  experiments  is  a  v;  rlruct,  whose 

disposition  is,   n.jvert.  elece,  to  be  accepted  aa  a  fiar  sanple  of     the  whole  irraent 
reminder.      Other  things  equal,   then,   tho  1  rger  the  3ampia  the  t;ro-.  tor  its 
evidential  value  and  the  loss  chance  of  a  more  occasional  coincidence  being 
mistaken  for  the  rwmanont  universal  teadeney. 

TkiM  danger  of  accidental  deviation  ha.3  been  discun-ed  in  the 
flrtt   pari  %erc  ehown  to  bi  t-.trictly  neuaurable  by  the  "prchc_ble  error." 

We  th«re  saw,  a  loo,  that  this  flnnfler  eaa  never  be  entirely  elirdnated  by  any  sample 
however  large,   so  that   it  ia  necesary  to  accept  some  standard  less  rigorous  than 
absolute  certainty  as  sufficient  for     all  practical  purposes;  usually,  the  danger 
of  mere  eidoaoa  is  considered  to  hs  inapprociable  when  a  corrc-lation 

is  observed  as  much  as  five  uines  greater  than  the  probable  error,   asaing  that 
mere  chance  would  not  produce  thia  once  is  l  I  raS  tines.     '."ence,   evidently, 

the  accidontal  deviation     depends,  not  only  on  the  number  cf  caes,  but  also  on  the 
largeness  of  the  really  existing  correopcnder.ee;  t£e  mere  perfect  the  latter,  the 
fewer  th<    case:;-  that  will  be  required  to  demonstrate  it  eeaeluaivelyi  and  this 
tendency  is  augmented  by  the  fact  fast  the    robable  error,  besides  varying  inversely 
with  "n",   doeB  so  to  a  further  axtent  with  "r"   (seG  i'oriu;la).     .ft  was  shown  in 
V  o  sa:-ss  part  that  the  size  of  the  probable  error  also  varies  accordingly  to  the 
method  of  calculation-  and  bo  juch  an    ixtut  that  t'senty  casoa  treated  in  one 
of  the  v/ays  described  furnishes  as  nuch  certitude  as  ItiO  in  another  more  usual 
way.     If  the  comuon  tlifold  el  cation  be  adopted,   an  OTOn  or  number 

ia  reqa^reJ  to  effect  the  same  purpose;   and  if  the  correlation  be  not  calculated 
quantitatively  at  all,  but  instead  be  presented  in  the  customary  fashion  to  the 
reader's  gOMOral  impression,  tl  en  no  mnabor  of  cases  whatever  apprear  sufficient 
to  .live  reasonable  guarantee  of  proof. 

While   :his  the  •  of   m-.h  jests  is  not  by  ■  jy       fcfca  the  sole 

car  in  liadnli  i  accidental  deviation  it  has  no  offset  whatever 

upon  tho  far  r.ore  formidable  systematic  3e  iation  except  that  it  indireotly 
's  to  an  taozsaraa  amgoontatic^         •  of  .     When  we  are  taJtiag  great  pains  to 

be  abl3  to  show  upon  paper  aa  Inpoalng  nutuber  of  ca3se  and  a  ive  probable 

error,  xs  are  in  the  self  saix  provess  :aost  lihely  introducing  a  systematic 

deviation  twenty  tines  greater. 

From  this  we  ther  th  number  o     case3  should  be 

totoraiaod  by  the  simple  principle,  that  the  measurements  to  ho  abrogated  to- 
gether should  have  the±r  error  brought  to  the  same  general     ordsr  of  magnitude. 
An  astronoiric_l  chroncrieter,  with  npring-detont  excaperrait,   ie  net    .he  best 
travailing  ^loch;    .or  is  there  ai  1  advantage  in  grafting  njoa  a  milestone 

(aa  has  actually  been  done  by  an  lnfatn  optician!)  the  distance  to  the 

•ost  Tillage  i  o  three  daoinal  plaeoo.     How,  tha  present  s'-.ago  of 

Correctional  Psychology  is  one  of  pioneering;  and,   instead  of  e  few  unwieldy 
expcrir.'ota,  we  re  uire  a  lar  of  snail  ones  careful ':.y  carried  out  under 

varied  1.1  considered  conditions.     At  the  time,  however,   the  probable  errox 

i  to  limits  at  any  rata  oaall  enough  for  the  particular  object  of 
ion  to  be  proved-*     Wtrr  euch  a  purpose  a  probable  error  may  at  present  be 
admitted  without  uoah  hesitation  up  to  about  0.05;  so  that,  by  adopting  the  method 
of  calculation  recommondod,  two  to  three  dozen  subjects  should  be  sufficient  for 

t  purposes.     Tho  precision  can  always  be  augmented  subs  ly,  by  carrying 

our  aijrJLlar  experiments  under  sinllar  conditions  and  then  taking  averages.     Only 
after  a  long  preliminary  exploration  of  thi3  rougher  sort,   small  we  b$r  in  a 
position  to  effectually  utilize  0X1  ks  designed  and  executed  from  the  very 

beginning  on  a  vast   scale. 
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